[Experimental][StarCode] KV Cache Injection #2080

dbogunowicz · 2024-02-15T13:11:31Z

Feature Description

The results of my experimentation with the tiny_starcoder model.

Findings:

the original KV cache is being added not as separate arrays: past_key_values.{attn_block_id}.values and past_key_values.{attn_block_id}.keys, but as a join array of keys and values. Did not get to look into breaking those two down, but by analyzing the onnx graph I do not see why we could not do it
the causal mask for this model has different dimensions than what we usually assume. This could be fixed by adding a node after the causal_mask input, that applies the appropriate permutation to the input to patch this.

This is an experimental branch, for which I will, for now, stop the development due to other priorities. To revisit in the future.

jeanniefinks · 2025-05-09T19:05:19Z

Per the main README announcement, SparseML is being deprecated by June 2, 2025. Closing the PR as work has been suspended; thank you for the inputs and support!

dbogunowicz and others added 6 commits February 15, 2024 14:11

Create starcode_kv_cache_injection

f785861

Delete starcode_kv_cache_injection

22d10a7

Add files via upload

c01f768

Add files via upload

8c7f799

producing running (hopefully) but incorrect model

c6ad191

Merge branch 'main' into experimentation

13d4619

jeanniefinks closed this May 9, 2025

jeanniefinks deleted the experimentation branch June 2, 2025 19:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Experimental][StarCode] KV Cache Injection #2080

[Experimental][StarCode] KV Cache Injection #2080

Uh oh!

dbogunowicz commented Feb 15, 2024 •

edited

Loading

Uh oh!

jeanniefinks commented May 9, 2025

Uh oh!

Uh oh!

[Experimental][StarCode] KV Cache Injection #2080

[Experimental][StarCode] KV Cache Injection #2080

Uh oh!

Conversation

dbogunowicz commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Feature Description

Findings:

Uh oh!

jeanniefinks commented May 9, 2025

Uh oh!

Uh oh!

dbogunowicz commented Feb 15, 2024 •

edited

Loading